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Methods and apparatus for binarisina images 
Field of the invention 

The present invention relates to methods for processing an image so as to 
classify pixels of the image based on an intensity threshold. In particular, the 
5 invention relates to such a method having an improved process for selection 
of the threshold. The invention is applicable to both medical and non-medical 
images. 

Background of Invention 

10 Binarisation is a well-known technique for image segmentation - that is 
classifying pixels of the image into two classes. Binarisation performs this 
classification based on whether a given pixel of the image has an intensity 
(gray-level) above or below a threshold. Binarisation has been widely applied 
to a number of image processing and computer vision applications, as a 

15 preliminary segmentation step. It makes an implicit assumption that an object 
of interest in the image has different intensity values from other (background) 
portions of the image. 

Many techniques exist for selection of the threshold. For example, in some 
20 such processes, the threshold can be selected in a process involving user 
interaction, while in other processes the threshold is selected entirely 
automatically. In some such processes the threshold is selected locally (i.e. 
such that the threshold varies from one pixel to another), while in other 
processes the threshold is the same over the whole image. 

25 

Most automatic threshold selection methods employ a histogram of the gray 
levels in the image. For example, Otsu [1] proposed a selection of the 
threshold to maximise the separability of the resultant classes in gray levels, 
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which is performed by minimising the within-class variance. Li and Lee [2] 
selected the threshold by minimising the cross entropy between the image 
and its segmented version. Kittler and lllingworth [3] selected the threshold by 
minimising the Bayes errors under the assumption that the object and pixel 

5 gray level values are normally distributed. Kapur et al [4] provided a maximum 
entropy approach. Wong and Sahoo [5] maximised the entropy with 
constraints on the region homogeneity and object boundary. Saha and Udupa 
[6] proposed a technique which maximised class uncertainty and homogeneity 
of the regions. Cheng et al [7] used the concept of fuzzy c-partition and the 

1 0 maximum fuzzy entropy principle to select a threshold, 

Cheung at al (US5,231,580A, 1993) disclosed an automatic method to 
characterise nerve fibres using local thresholds. It first partitions the entire 
image into sub-images and finds the threshold for each sub-image using a 
15 histogram-based thresholding method. Then, the pixel-wise threshold is 
approximated by interpolating the thresholds of neighbouring subimages. 

Summary of the Invention 

It is observed that the existing methods for selecting a threshold described 
20 above lack a mechanism for incorporating prior knowledge about the images 
to be binarised. 

Thus, the present invention aims to provide a new and useful technique for 
selecting a threshold for binarising an image, and in particular one which 
enables prior knowledge to be explicitly incorporated. 

25 In general terms, the invention proposes firstly that this prior knowledge is 
used to define a region of interest (ROI) in the image, such that the analysis of 
frequency distribution of pixel intensities (represented by a frequency 
histogram) is performed only for pixels in the ROI. Secondly, the invention 
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proposes that the prior knowledge is used to select an intensity range, and 
that only pixels within this intensity range are used to generate the frequency 
distribution from which the threshold is selected. 

These two ideas are in principle separate, but in combination they provide a 
5 highly effective mechanism for incorporating prior knowledge into the 
threshold selection. The advantage is critical whether the image is a medical 
one or not. In particular, a threshold can be found to binarise images which 
exhibits high robustness to imaging artefacts such as gray level 
inhomogeneity and noise. 

10 Specifically, one expression of the invention is a method of binarising an 
image composed of pixels having respective intensity values, the method 
comprising: 

(i) using prior knowledge about the image to derive a region of interest 
within it; 

15 (ii) using prior knowledge about the image to derive an intensity range 

of pixels in the said region of interest; 

(iii) obtaining a frequency distribution of the intensities within the said 
intensity range of pixels within the said region of interest; 

(iv) using the said frequency distribution to derive an intensity 
20 threshold; and 

(v) binarising the image by classifying pixels in the said region of 
interest according to whether their intensities are above or below the said 
intensity threshold. 
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The invention may alternatively be expressed as a computer system which is 
set up to perform such a method. Alternatively, it can be expressed as 
software for performing the method. 

Brief Description of The Figures 

5 Preferred features of the invention will now be described, for the sake of 
illustration only, with reference to the following figures in which: 

Fig. 1 shows the steps in a method which is an embodiment of the 
invention; 

Fig. 2 shows an MR SPGR intercommissural axial slice of a brain, 
10 which is a suitable subject for the method of Fig. 1 ; 

Fig. 3 shows a region of interest within the image of Fig. 2 derived by a 
first step of the method of Fig. 1 ; 

Fig. 4 is a gray-level histogram of the ROI shown in Fig. 3, and a 
threshold selected in one form of a step of the method of Fig. 1; and 
15 Fig. 5 shows the binarised image using the threshold selected in the 

method of Fig. 1. 

Detailed Description of the embodiments 

20 

Referring firstly to Fig. 1, the overall steps of a method which is an 
embodiment of the invention are shown. 

In step 1, an image is input 

25 

In step 2, prior knowledge of the image is used to define a region of interest 
(ROI) which is a subset of the image. This process can be done by whatever 
means, either automatic, semi-automatic, or even manual. 
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In step 3 an analysis is performed on the frequency of occurrence of 
intensities within the ROI, and a range of frequencies is defined, again using 
prior knowledge. 

5 For example, without losing generality, we denote the image to be processed 
as f(x), where f(x) is the gray level at a pixel labelled x. It is further supposed 
that the processed image has L gray levels denoted by r f where / is an integer 
in the range 0 to L-1 and r 0 < r x < ...r M . It is also assumed that the object of 
interest has higher intensity values than the background. Suppose that due to 

10 prior knowledge or test we know that the proportion of the region of Interest 
which is occupied by the object is in the percentage range per 0 to pen. 

Let h(i) denote the frequency of gray level n f and let H(i) denote the 
cumulative frequency which is ^Wi'), where /' is an integer dummy index. 

i'mQ 

15 Considering two values of / written as m and /?, the frequency of intensities in 
the range r m to r n is Thus, we can use pero to calculate a gray level 

n ow , such that we are sure that all the pixels having lower intensity represent 
background. n ow can be written as: 

20 r hw = minfc- 1 H(i) Z per Q }. (1) 

Similarly, we can use pen to calculate a gray level r^igh such that we are sure 
that all pixels having higher intensity represent the object: 

r high =rmn{i\H(i)*per x }. (2) 

25 In a step 4 of the method of Fig 1 , the threshold is selected using an algorithm 
which operates on the frequencies within the selected range from n ow to r h f gh . 
The details of several ways in which this can be carried out within the scope 
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of the invention are given below. Thus, a selected threshold is output in step 
5. 

Image binarisation is then performed using this threshold, to create an image 
5 in which all pixels (at least in the ROI) are classified into two classes. Further 
image processing steps may optionally be performed at this stage. 

We now turn to a discussion of three techniques by which step 4 can be 
carried out 

10 

1. Range-constrained least valley detection method (RCLVD) 

If the frequency range derived in step 3 is correctly estimated then it will 
include a valley in the frequency distribution of intensities. This valley 
15 separates the background and the object. Thus, valley detection can be 
exploited to select the threshold. This has the following steps: 

1) A frequency interval <5fcis specified. 

2) The gray level range [r, ow ,r A , 5A ] is partitioned into K+1 intervals with an equal 
20 frequency ranged. For an interval labelled by integer index / the lower end 

of its intensity range is denoted r x J and the upper end is denoted r( . Thus: 

r? =r low , r 2 ° =rrfa{i\HQ)Z(j>er Q +Sh)} t 

r t l =r 2 °, r\ =rmn{i\H(i)Z(Hti) + Sh)}, 

25 r * = rf~ l . rf = rmn{i \ HQ) Z (H(r x K ) + 
H(r } K + Sh) £ per x and H(r x K ) < per } . 

3) The average frequency h J for each of the intervals j is calculated given by 
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4) Let J denote the interval for which h J is a minimum. The threshold of this 
RCVLD method, which is denoted 0 RCVLD . may be selected to be any value in 

the range r/ to r( , such as 0 RCVLD = [r 2 J +/■/)/ 2. 

5 

2. Range-constrained weighted variance method (RCVW) 

Let r k fall within the range n ow to r Mg h, and suppose that the pixels of the ROI 
are in two classes C* and C 2 , where C1 is pixels of the background class and 
10 consists of pixels with gray levels /w to r k , and C 2 is pixels of the object class 
and is composed of pixels with gray levels r k +1 to r^/,. The range-constrained 
weighted variance method maximises the "weighted between-class variance" 
defined as: 

ORawiW^) = max (Pr(C 1 )D(C 1 )^ 1 +Pr(C 2 )£>(C 2 )^ 2 ), 

15 where Wi and W 2 are two positive constants selected by the user and 
representing the weights of the two respective class variances, Pr(.) denotes 
the class probability, i.e. 

Pr(CJ= j>(0, Pr(C 2 )= J*©. 

and D(Ci) and D(C$ are given by: 

20 D(C 1 ) = C« 0 -// r ) 2 and W^^fo ~// r ) 2 , where /i r = 2>xA(0 f 

Mo* 

/<o = £/xA(0and m = £fxA(Q. 

When IrV/ is bigger than W 2| background homogeneity is emphasised. 

3. Range-constrained fuzzy c-partition thresholding method (RCFCP) 

25 
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This third method is related to the technique used in [7], and the justification 
for it is as given there. In general terms, let A„ /A 0 be the fuzzy sets of fuzzy 
events "background/object" (which denotes a fuzzy partition of the set 
f ..,r hlgh with a membership function /i Ai /ju^ respectively). The probability 
5 of these fuzzy events are given by: 

J°C4) = Pa, U)*hj , where A, e {A b ,A 0 j, and the weighted entropy with this 

J m Qow 

fuzzy partition can be calculated as: 
S(W X ,W 2 ) = W x xP(A b )xlo B P(A b ) + W 2 xP(A 0 )xlo g P(A 0 ) 
where W 1 and W 2 are two positive constants, and log(.) is the natural 
10 logarithm. 

Let r low <a<c< t r hlgh . The membership functions can be defined as follows: 



and 



1, r hw £x<,a 

(x - c) I {a - c) a<x<c 
0 c<x£r Mgh 



(x-a)/(c-a) a<x<c 



0 c<x£r m 
15 The optimum parameters a* and c* are chosen to maximise the entropy 
S(Wf,l/Vy, and the optimum threshold is 0 RCFCP ={a +c*)/2. 

Having now presented the steps of the embodiment in principle, we turn to an 
example of the embodiment in operation. This example uses the form of step 
20 4 referred to above as RCLVD. 



The starting point of the method is the image shown in Fig. 2, an MR 
(Magnetic Resonance) image which is a T1 -weighted or SPGR (spoiled 
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gradient recalled acquisition) axial slice around the intercommissural plane. 
This image is input in step 1 of the method. 

In step 2 of the method, we calculate the pixels enclosed by the skull (i.e. find 
the ROl) using the following steps: the usual histogram-based thresholding 
method is used to binarise the axial slice; a morphological closing operation is 
used to connect small gaps; the largest connected component is identified; 
and the holes within the component are filled. The resulting ROl (the pixels 
enclosed by the skull) is shown in Fig. 3. 

In step 3, the two percentages per 0 and pen are set as 14% and 28%. This 
selection is based on previous experiments and/or other prior knowledge. 

In step 4 of the method (RCLVD), we select the<5feto be 1% (alternatively any 
value in the range 1% to 5% would be suitable). Fig. 4 shows the histogram of 
frequencies in the ROl, and the calculated threshold 0 RCLVD is shown as the 
line indicated. This completes the procedure of the embodiment. 

The output threshold of the method is used as in conventional techniques to 
binarise the image. The binarised image is shown in Fig. 5. 

Although only a single embodiment of the invention has been described, 
many variations are possible within the scope of the invention as will be clear 
to a skilled reader. 
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Claims 

1. A method of binarising an image composed of pixels having respective 
intensity values, the method comprising: 

(i) using prior knowledge about the image to derive a region of interest 
5 within it; 

(ii) using prior knowledge about the image to derive an intensity range 
of pixels in the said region of interest; 

(iii) obtaining a frequency distribution of the intensities within the said 
intensity range of pixels within the said region of interest; 

10 (iv) using the said frequency distribution to derive an intensity 

threshold; and 

(v) binarising the image by classifying pixels in the said region of 
interest according to whether their intensities are above or below the said 
intensity threshold. 

15 2. A method according to claim 1 in which in step (iv), the threshold is 
found by deriving a valley in the frequency distribution within the range, and 
selecting the intensity threshold to correspond to the valley. 

3. A method according to claim 2 in which the valley is found by 
determining the total intensities in a number of intervals defined in the range, 

20 and selecting the intensity threshold as an intensity within the interval having 
the lowest total intensity. 

4. A method according to claim 3 in which the intensity threshold is 
selected as the mid-point of the interval having the lowest total intensity. 



12 



5. A method according to claim 1 in step (iv) the threshold is found by 
minimising a function which is a sum of the variances of the intensities below 
and above the threshold. 

6. A method according to claim 5 in which the sum is a weighted sum 
defined based on two constants Wi and W 2 . 

7. A method according to claim 6 in which, representing labelling the 
possible values of pixel intensity by an integer index / and their respective 
frequencies by h(i), and writing the lower and upper intensities respectively as 
riow and r h igh, the weighted sum is given by 

0*cwv<Wi>W*) = maxPKCiWQW + Pr(C 2 )£>(C 2 )*F 2 ), 

where Pr(.) denotes the class probability (Pr(C;) = and 



P*(C 2 ) = 2^(0). and D(d) and DfC^ are given by: 



D(C X ) = C" 0 - // r ) 2 and £>(C 2 ) = (/i 1 -MtY. where th = £^ x K0 . 



8. A method according to claim 1 in which step (iv) is performed by 
selecting the threshold as a function of parameters which maximise an 
entropy function which indicates the entropy of a fuzzy partition of the pixels 
into classes based on the parameters. 

9. A method of processing an image which includes binarising it by a 
thresholding method according to any preceding claim, and then modifying 
the classification of one or more of the pixels by considering spatial 
relationships between the locations of the classified pixels. 
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10. A computer program product comprising a recording medium and 
programming instructions stored on the recording medium and readable by a 
computer system to cause the computer system to perform a method 
according to any preceding claim. 

5 11. A computer system for binarising an image composed of pixels having 
respective intensity values, the system including: 

(i) at least one data input device for a user to select a region of interest 
in the image and specify a frequency range within the frequency distribution of 
the intensities of pixels in the region of interest; 

10 (ii) a processor arranged to obtain a frequency distribution of the 

intensities within the intensity range of pixels within the region of interest, use 
the frequency distribution to derive an intensity threshold; and binarise the 
image by classifying pixels in the region of interest according to whether their 
intensities are above or below the threshold. 

15 12. A system according to claim 11 in which the processor is arranged to 
derive the threshold by deriving a valley in the frequency distribution within the 
range, and selecting the intensity threshold to correspond to the valley. 

13. A system according to claim 12 in which processor is arranged to find 
the valley by determining the total intensities in a number of intervals defined 

20 in the range, and selecting the intensity threshold as an intensity within the 
interval having the lowest total intensity. 

14. A system according to claim 13 in which the processor is arranged to 
select the intensity threshold as the mid-point of the interval having the lowest 
total intensity. 
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15. A system according to claim 14 in which the processor is arranged to 
select the threshold by minimising a function which is a sum of the variances 
of the intensities below and above the threshold. 

16. A system according to claim 15 in which the sum is a weighted sum 
5 defined based on two constants Wi and W 2 . 

17. A system according to claim 16 in which, representing labelling the 
possible values of pixel intensity by an integer index / and their respective 
frequencies by h(i), and writing the lower and upper intensities respectively as 
n ow and r h ig hl the weighted sum is given by 

10 e RCLWV qv y9 w 2 ) = maxWQW)^ +Pr(c 2 )i)(c 2 )Fr 2 ), 

where Pr(.) denotes the class probability (PrfC^^ J^h(i) and 
Pr(C 2 ) = 2*®)" and D (° 1 ) and D ^ are 9iVen by: 

D(C X ) = (ju 0 - Mr f ancl D(C 2 ) = (a, - ju T f , where p T - £ ' * *(0 . 

15 18. A system according to claim 11 in which the processor is arranged to 
select the threshold as a function of one or more parameters which maximise 
an entropy function which indicates the entropy of a fuzzy partition of the 
pixels into classes based on the parameters. 

19. A system according to any of claims 11 to 18 in which the processor is 
20 further arranged to process the segmented image by modifying the classes to 
which each pixel is allocated by considering relationships between the 
locations of the pixels which have been classified. 
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Abstract 

Methods and apparatus for binarisina images 

5 A method is proposed for binarising an image by deriving an intensity 
threshold and classifying pixels according to whether their intensity is below or 
above the threshold. In the derivation of the threshold, prior knowledge is 
used to define a region of interest (ROI) in the image. Furthermore, prior 
knowledge is used to select a range in the frequency distribution of the 

10 intensities of the pixels in the ROI, and that only data within this frequency 
range is used to derive the threshold. These techniques provide a highly 
effective mechanism for incorporating prior knowledge into the threshold 
selection which is critical whether the image is a medical image or not. In 
particular, a threshold can be found to binarise images which exhibits high 

1 5 robustness to imaging artefacts such as gray level inhomogeneity and noise. 
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Figure 5. 



